Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 966400 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 3158 |
| Duplicate rows (%) | 0.3% |
| Total size in memory | 645.0 MiB |
| Average record size in memory | 699.8 B |
Variable types
| Numeric | 6 |
|---|---|
| DateTime | 2 |
| Categorical | 6 |
| Text | 4 |
| Dataset has 3158 (0.3%) duplicate rows | Duplicates |
CATEGORY is highly overall correlated with SUBCATEGORY | High correlation |
SALES_PTR_VALUE is highly overall correlated with SALES_VALUE and 1 other fields | High correlation |
SALES_VALUE is highly overall correlated with SALES_PTR_VALUE and 1 other fields | High correlation |
SALES_VOLUME is highly overall correlated with SALES_PTR_VALUE and 1 other fields | High correlation |
SUBCATEGORY is highly overall correlated with CATEGORY | High correlation |
SALES_VALUE is highly skewed (γ1 = 33.86988697) | Skewed |
SALES_UNITS is highly skewed (γ1 = 57.47378806) | Skewed |
SALES_VOLUME is highly skewed (γ1 = 34.22440987) | Skewed |
SALES_PTR_VALUE is highly skewed (γ1 = 32.65774068) | Skewed |
Reproduction
| Analysis started | 2024-09-25 06:24:17.165898 |
|---|---|
| Analysis finished | 2024-09-25 06:24:45.135005 |
| Duration | 27.97 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
MNTH_CODE
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 202372.88 |
| Minimum | 202309 |
|---|---|
| Maximum | 202408 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.7 MiB |
Quantile statistics
| Minimum | 202309 |
|---|---|
| 5-th percentile | 202309 |
| Q1 | 202312 |
| median | 202403 |
| Q3 | 202406 |
| 95-th percentile | 202408 |
| Maximum | 202408 |
| Range | 99 |
| Interquartile range (IQR) | 94 |
Descriptive statistics
| Standard deviation | 44.525843 |
|---|---|
| Coefficient of variation (CV) | 0.00022001883 |
| Kurtosis | -1.5218993 |
| Mean | 202372.88 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.68520023 |
| Sum | 1.9557315 × 1011 |
| Variance | 1982.5507 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 202406 | 102585 | |
| 202309 | 96453 | |
| 202312 | 89583 | |
| 202403 | 84599 | |
| 202407 | 83813 | |
| 202408 | 74960 | |
| 202404 | 74939 | |
| 202402 | 74173 | |
| 202311 | 73399 | |
| 202401 | 73094 | |
| Other values (2) | 138802 |
| Value | Count | Frequency (%) |
| 202309 | 96453 | |
| 202310 | 66211 | |
| 202311 | 73399 | |
| 202312 | 89583 | |
| 202401 | 73094 | |
| 202402 | 74173 | |
| 202403 | 84599 | |
| 202404 | 74939 | |
| 202405 | 72591 | |
| 202406 | 102585 |
| Value | Count | Frequency (%) |
| 202408 | 74960 | |
| 202407 | 83813 | |
| 202406 | 102585 | |
| 202405 | 72591 | |
| 202404 | 74939 | |
| 202403 | 84599 | |
| 202402 | 74173 | |
| 202401 | 73094 | |
| 202312 | 89583 | |
| 202311 | 73399 |
TRANS_DATE
Date
| Distinct | 303 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| Minimum | 2023-08-29 00:00:00 |
|---|---|
| Maximum | 2024-08-27 00:00:00 |
START_DATE
Date
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| Minimum | 2023-08-28 00:00:00 |
|---|---|
| Maximum | 2024-07-31 00:00:00 |
SALES_VALUE
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 8242 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 446.41745 |
| Minimum | 2.86 |
|---|---|
| Maximum | 145728.12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 MiB |
Quantile statistics
| Minimum | 2.86 |
|---|---|
| 5-th percentile | 53.57 |
| Q1 | 140 |
| median | 192.24 |
| Q3 | 450 |
| 95-th percentile | 1537.89 |
| Maximum | 145728.12 |
| Range | 145725.26 |
| Interquartile range (IQR) | 310 |
Descriptive statistics
| Standard deviation | 1053.3556 |
|---|---|
| Coefficient of variation (CV) | 2.3595754 |
| Kurtosis | 2647.1191 |
| Mean | 446.41745 |
| Median Absolute Deviation (MAD) | 87.76 |
| Skewness | 33.869887 |
| Sum | 4.3141783 × 108 |
| Variance | 1109558.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 142.86 | 106972 | 11.1% |
| 53.57 | 42760 | 4.4% |
| 140 | 39597 | 4.1% |
| 138.57 | 30170 | 3.1% |
| 107.14 | 28309 | 2.9% |
| 137.14 | 24945 | 2.6% |
| 142.84 | 21903 | 2.3% |
| 280 | 12096 | 1.3% |
| 131.06 | 10428 | 1.1% |
| 163.64 | 10138 | 1.0% |
| Other values (8232) | 639082 |
| Value | Count | Frequency (%) |
| 2.86 | 2 | < 0.1% |
| 4.46 | 42 | < 0.1% |
| 7.81 | 1 | < 0.1% |
| 8.57 | 10 | < 0.1% |
| 8.65 | 1 | < 0.1% |
| 8.66 | 13 | < 0.1% |
| 8.75 | 16 | < 0.1% |
| 8.92 | 1 | < 0.1% |
| 8.93 | 254 | |
| 13.39 | 84 | < 0.1% |
| Value | Count | Frequency (%) |
| 145728.12 | 1 | |
| 144803.75 | 1 | |
| 118027.64 | 1 | |
| 117676.65 | 1 | |
| 114606.54 | 1 | |
| 112931 | 1 | |
| 104407.27 | 1 | |
| 103488 | 1 | |
| 96727.27 | 1 | |
| 95594.54 | 1 |
SALES_UNITS
Real number (ℝ)
SKEWED 
| Distinct | 359 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.696476 |
| Minimum | 1 |
|---|---|
| Maximum | 10240 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 6 |
| Q3 | 16 |
| 95-th percentile | 32 |
| Maximum | 10240 |
| Range | 10239 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 40.761336 |
|---|---|
| Coefficient of variation (CV) | 3.210445 |
| Kurtosis | 7131.9482 |
| Mean | 12.696476 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 57.473788 |
| Sum | 12269874 |
| Variance | 1661.4865 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 260404 | |
| 1 | 131257 | |
| 2 | 122969 | |
| 3 | 119386 | |
| 6 | 72771 | 7.5% |
| 12 | 71214 | 7.4% |
| 32 | 57991 | 6.0% |
| 4 | 34619 | 3.6% |
| 24 | 25479 | 2.6% |
| 8 | 17878 | 1.8% |
| Other values (349) | 52432 | 5.4% |
| Value | Count | Frequency (%) |
| 1 | 131257 | |
| 2 | 122969 | |
| 3 | 119386 | |
| 4 | 34619 | 3.6% |
| 5 | 5173 | 0.5% |
| 6 | 72771 | |
| 7 | 579 | 0.1% |
| 8 | 17878 | 1.8% |
| 9 | 601 | 0.1% |
| 10 | 2070 | 0.2% |
| Value | Count | Frequency (%) |
| 10240 | 1 | < 0.1% |
| 6000 | 1 | < 0.1% |
| 5120 | 2 | |
| 4800 | 1 | < 0.1% |
| 4388 | 1 | < 0.1% |
| 3840 | 1 | < 0.1% |
| 3600 | 3 | |
| 3500 | 1 | < 0.1% |
| 3360 | 1 | < 0.1% |
| 3200 | 1 | < 0.1% |
SALES_VOLUME
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 1581 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00093144967 |
| Minimum | 1.1 × 10-5 |
|---|---|
| Maximum | 0.2755 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 MiB |
Quantile statistics
| Minimum | 1.1 × 10-5 |
|---|---|
| 5-th percentile | 0.000144 |
| Q1 | 0.000368 |
| median | 0.000448 |
| Q3 | 0.0009 |
| 95-th percentile | 0.00286 |
| Maximum | 0.2755 |
| Range | 0.275489 |
| Interquartile range (IQR) | 0.000532 |
Descriptive statistics
| Standard deviation | 0.0020629814 |
|---|---|
| Coefficient of variation (CV) | 2.2148071 |
| Kurtosis | 2628.5773 |
| Mean | 0.00093144967 |
| Median Absolute Deviation (MAD) | 0.000198 |
| Skewness | 34.22441 |
| Sum | 900.15296 |
| Variance | 4.2558922 × 10-6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.000384 | 67198 | 7.0% |
| 0.0004 | 63048 | 6.5% |
| 0.000416 | 44261 | 4.6% |
| 0.0005 | 32689 | 3.4% |
| 0.000144 | 32367 | 3.3% |
| 0.0003 | 25749 | 2.7% |
| 0.00075 | 24511 | 2.5% |
| 0.000368 | 22589 | 2.3% |
| 0.000272 | 21633 | 2.2% |
| 0.000132 | 20296 | 2.1% |
| Other values (1571) | 612059 |
| Value | Count | Frequency (%) |
| 1.1 × 10-5 | 6 | < 0.1% |
| 1.2 × 10-5 | 36 | < 0.1% |
| 1.7 × 10-5 | 17 | < 0.1% |
| 1.8 × 10-5 | 6 | < 0.1% |
| 2 × 10-5 | 7 | < 0.1% |
| 2.2 × 10-5 | 22 | < 0.1% |
| 2.3 × 10-5 | 33 | < 0.1% |
| 2.4 × 10-5 | 106 | |
| 2.5 × 10-5 | 47 | |
| 2.6 × 10-5 | 50 |
| Value | Count | Frequency (%) |
| 0.2755 | 1 | < 0.1% |
| 0.256 | 1 | < 0.1% |
| 0.2464 | 1 | < 0.1% |
| 0.242 | 1 | < 0.1% |
| 0.2375 | 1 | < 0.1% |
| 0.219 | 1 | < 0.1% |
| 0.21 | 1 | < 0.1% |
| 0.2097 | 1 | < 0.1% |
| 0.2095 | 1 | < 0.1% |
| 0.192 | 3 |
SALES_PTR_VALUE
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 2024 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 457.81468 |
| Minimum | 1.7857143 |
|---|---|
| Maximum | 151800 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.4 MiB |
Quantile statistics
| Minimum | 1.7857143 |
|---|---|
| 5-th percentile | 53.571429 |
| Q1 | 142.85714 |
| median | 198.18182 |
| Q3 | 450 |
| 95-th percentile | 1585.4545 |
| Maximum | 151800 |
| Range | 151798.21 |
| Interquartile range (IQR) | 307.14286 |
Descriptive statistics
| Standard deviation | 1101.7691 |
|---|---|
| Coefficient of variation (CV) | 2.4065832 |
| Kurtosis | 2447.2998 |
| Mean | 457.81468 |
| Median Absolute Deviation (MAD) | 91.038961 |
| Skewness | 32.657741 |
| Sum | 4.4243211 × 108 |
| Variance | 1213895.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 142.8571429 | 239443 | |
| 53.57142857 | 52969 | 5.5% |
| 285.7142857 | 50215 | 5.2% |
| 107.1428571 | 33816 | 3.5% |
| 163.6363636 | 10338 | 1.1% |
| 313.6363636 | 9855 | 1.0% |
| 336.3636364 | 9789 | 1.0% |
| 270 | 9751 | 1.0% |
| 104.5454545 | 9638 | 1.0% |
| 209.0909091 | 9229 | 1.0% |
| Other values (2014) | 531357 |
| Value | Count | Frequency (%) |
| 1.785714286 | 1 | < 0.1% |
| 4.464285714 | 42 | < 0.1% |
| 8.035714286 | 6 | < 0.1% |
| 8.928571429 | 290 | |
| 13.39285714 | 84 | < 0.1% |
| 16.07142857 | 5 | < 0.1% |
| 17.85714286 | 260 | |
| 22.32142857 | 15 | < 0.1% |
| 24.10714286 | 3 | < 0.1% |
| 26.78571429 | 90 | < 0.1% |
| Value | Count | Frequency (%) |
| 151800 | 1 | |
| 144659.0909 | 1 | |
| 120436.3636 | 1 | |
| 117559.0909 | 1 | |
| 116945.4545 | 1 | |
| 112818.1818 | 1 | |
| 107636.3636 | 1 | |
| 105600 | 1 | |
| 97545.45455 | 1 | |
| 97454.54545 | 2 |
OC_CODE
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 202206.57 |
| Minimum | 202201 |
|---|---|
| Maximum | 202212 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.7 MiB |
Quantile statistics
| Minimum | 202201 |
|---|---|
| 5-th percentile | 202201 |
| Q1 | 202204 |
| median | 202207 |
| Q3 | 202209 |
| 95-th percentile | 202212 |
| Maximum | 202212 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.4045263 |
|---|---|
| Coefficient of variation (CV) | 1.6836873 × 10-5 |
| Kurtosis | -1.1578826 |
| Mean | 202206.57 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.0061898714 |
| Sum | 1.9541243 × 1011 |
| Variance | 11.590799 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 202206 | 102585 | |
| 202209 | 96453 | |
| 202212 | 89583 | |
| 202203 | 84599 | |
| 202207 | 83813 | |
| 202208 | 74960 | |
| 202204 | 74939 | |
| 202202 | 74173 | |
| 202211 | 73399 | |
| 202201 | 73094 | |
| Other values (2) | 138802 |
| Value | Count | Frequency (%) |
| 202201 | 73094 | |
| 202202 | 74173 | |
| 202203 | 84599 | |
| 202204 | 74939 | |
| 202205 | 72591 | |
| 202206 | 102585 | |
| 202207 | 83813 | |
| 202208 | 74960 | |
| 202209 | 96453 | |
| 202210 | 66211 |
| Value | Count | Frequency (%) |
| 202212 | 89583 | |
| 202211 | 73399 | |
| 202210 | 66211 | |
| 202209 | 96453 | |
| 202208 | 74960 | |
| 202207 | 83813 | |
| 202206 | 102585 | |
| 202205 | 72591 | |
| 202204 | 74939 | |
| 202203 | 84599 |
DISTRIBUTOR_CODE
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.1 MiB |
| DB0110 | |
|---|---|
| DB0209 | |
| DB0706 | |
| DB0652 | |
| DB0655 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 5798400 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DB0209 |
|---|---|
| 2nd row | DB0706 |
| 3rd row | DB0209 |
| 4th row | DB0209 |
| 5th row | DB0209 |
Common Values
| Value | Count | Frequency (%) |
| DB0110 | 278245 | |
| DB0209 | 217421 | |
| DB0706 | 194044 | |
| DB0652 | 142181 | |
| DB0655 | 134509 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| db0110 | 278245 | |
| db0209 | 217421 | |
| db0706 | 194044 | |
| db0652 | 142181 | |
| db0655 | 134509 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1656110 | |
| D | 966400 | |
| B | 966400 | |
| 1 | 556490 | 9.6% |
| 6 | 470734 | 8.1% |
| 5 | 411199 | 7.1% |
| 2 | 359602 | 6.2% |
| 9 | 217421 | 3.7% |
| 7 | 194044 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5798400 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1656110 | |
| D | 966400 | |
| B | 966400 | |
| 1 | 556490 | 9.6% |
| 6 | 470734 | 8.1% |
| 5 | 411199 | 7.1% |
| 2 | 359602 | 6.2% |
| 9 | 217421 | 3.7% |
| 7 | 194044 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5798400 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1656110 | |
| D | 966400 | |
| B | 966400 | |
| 1 | 556490 | 9.6% |
| 6 | 470734 | 8.1% |
| 5 | 411199 | 7.1% |
| 2 | 359602 | 6.2% |
| 9 | 217421 | 3.7% |
| 7 | 194044 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5798400 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1656110 | |
| D | 966400 | |
| B | 966400 | |
| 1 | 556490 | 9.6% |
| 6 | 470734 | 8.1% |
| 5 | 411199 | 7.1% |
| 2 | 359602 | 6.2% |
| 9 | 217421 | 3.7% |
| 7 | 194044 | 3.3% |
OUTLET_CODE
Text
| Distinct | 18833 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.5882212 |
| Min length | 7 |
Characters and Unicode
| Total characters | 7333257 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 382 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OL12036 |
|---|---|
| 2nd row | OL49989 |
| 3rd row | OL112160 |
| 4th row | OL175188 |
| 5th row | OL80360 |
| Value | Count | Frequency (%) |
| ol128896 | 1289 | 0.1% |
| ol191061 | 1277 | 0.1% |
| ol49938 | 1243 | 0.1% |
| ol143966 | 1223 | 0.1% |
| ol11104 | 1114 | 0.1% |
| ol223486 | 1089 | 0.1% |
| ol191033 | 1085 | 0.1% |
| ol32854 | 1080 | 0.1% |
| ol80887 | 1048 | 0.1% |
| ol159815 | 1035 | 0.1% |
| Other values (18823) | 954917 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 966400 | |
| L | 966400 | |
| 1 | 879608 | |
| 2 | 729852 | |
| 3 | 514930 | |
| 9 | 513662 | |
| 4 | 494825 | |
| 6 | 477333 | |
| 5 | 467004 | |
| 8 | 459480 | |
| Other values (2) | 863763 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7333257 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| O | 966400 | |
| L | 966400 | |
| 1 | 879608 | |
| 2 | 729852 | |
| 3 | 514930 | |
| 9 | 513662 | |
| 4 | 494825 | |
| 6 | 477333 | |
| 5 | 467004 | |
| 8 | 459480 | |
| Other values (2) | 863763 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7333257 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| O | 966400 | |
| L | 966400 | |
| 1 | 879608 | |
| 2 | 729852 | |
| 3 | 514930 | |
| 9 | 513662 | |
| 4 | 494825 | |
| 6 | 477333 | |
| 5 | 467004 | |
| 8 | 459480 | |
| Other values (2) | 863763 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7333257 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| O | 966400 | |
| L | 966400 | |
| 1 | 879608 | |
| 2 | 729852 | |
| 3 | 514930 | |
| 9 | 513662 | |
| 4 | 494825 | |
| 6 | 477333 | |
| 5 | 467004 | |
| 8 | 459480 | |
| Other values (2) | 863763 |
CITY
Text
| Distinct | 1679 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 8.7443057 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8450497 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Wauwatosa |
|---|---|
| 2nd row | Huntington |
| 3rd row | Saint Augustine |
| 4th row | Redwood City |
| 5th row | Kokomo |
| Value | Count | Frequency (%) |
| city | 32161 | 2.6% |
| new | 14047 | 1.2% |
| beach | 13726 | 1.1% |
| san | 12949 | 1.1% |
| springs | 10660 | 0.9% |
| fort | 10024 | 0.8% |
| park | 9132 | 0.8% |
| west | 8548 | 0.7% |
| saint | 7971 | 0.7% |
| falls | 6906 | 0.6% |
| Other values (1634) | 1090751 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 777071 | 9.2% |
| e | 776788 | 9.2% |
| n | 643293 | 7.6% |
| o | 634555 | 7.5% |
| r | 534447 | 6.3% |
| l | 513454 | 6.1% |
| i | 509793 | 6.0% |
| t | 467095 | 5.5% |
| s | 353589 | 4.2% |
| 250475 | 3.0% | |
| Other values (44) | 2989937 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8450497 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 777071 | 9.2% |
| e | 776788 | 9.2% |
| n | 643293 | 7.6% |
| o | 634555 | 7.5% |
| r | 534447 | 6.3% |
| l | 513454 | 6.1% |
| i | 509793 | 6.0% |
| t | 467095 | 5.5% |
| s | 353589 | 4.2% |
| 250475 | 3.0% | |
| Other values (44) | 2989937 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8450497 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 777071 | 9.2% |
| e | 776788 | 9.2% |
| n | 643293 | 7.6% |
| o | 634555 | 7.5% |
| r | 534447 | 6.3% |
| l | 513454 | 6.1% |
| i | 509793 | 6.0% |
| t | 467095 | 5.5% |
| s | 353589 | 4.2% |
| 250475 | 3.0% | |
| Other values (44) | 2989937 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8450497 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 777071 | 9.2% |
| e | 776788 | 9.2% |
| n | 643293 | 7.6% |
| o | 634555 | 7.5% |
| r | 534447 | 6.3% |
| l | 513454 | 6.1% |
| i | 509793 | 6.0% |
| t | 467095 | 5.5% |
| s | 353589 | 4.2% |
| 250475 | 3.0% | |
| Other values (44) | 2989937 |
STATE
Categorical
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.4 MiB |
| California | |
|---|---|
| Illinois | 56753 |
| Massachusetts | 47597 |
| Connecticut | 41359 |
| New York | 39138 |
| Other values (45) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 8.5429739 |
| Min length | 4 |
Characters and Unicode
| Total characters | 8255930 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Wisconsin |
|---|---|
| 2nd row | West Virginia |
| 3rd row | Florida |
| 4th row | California |
| 5th row | Indiana |
Common Values
| Value | Count | Frequency (%) |
| California | 111427 | 11.5% |
| Illinois | 56753 | 5.9% |
| Massachusetts | 47597 | 4.9% |
| Connecticut | 41359 | 4.3% |
| New York | 39138 | 4.0% |
| Alabama | 38935 | 4.0% |
| Florida | 38350 | 4.0% |
| Colorado | 30124 | 3.1% |
| New Jersey | 26100 | 2.7% |
| Arkansas | 25677 | 2.7% |
| Other values (40) | 510940 |
Length
| Value | Count | Frequency (%) |
| california | 111427 | 10.1% |
| new | 84237 | 7.6% |
| illinois | 56753 | 5.1% |
| massachusetts | 47597 | 4.3% |
| connecticut | 41359 | 3.8% |
| york | 39138 | 3.6% |
| alabama | 38935 | 3.5% |
| florida | 38350 | 3.5% |
| colorado | 30124 | 2.7% |
| jersey | 26100 | 2.4% |
| Other values (42) | 588071 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1132803 | |
| i | 865262 | 10.5% |
| n | 692063 | 8.4% |
| o | 663356 | 8.0% |
| s | 604292 | 7.3% |
| e | 451896 | 5.5% |
| r | 443948 | 5.4% |
| l | 435199 | 5.3% |
| t | 293046 | 3.5% |
| C | 205185 | 2.5% |
| Other values (36) | 2468880 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8255930 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1132803 | |
| i | 865262 | 10.5% |
| n | 692063 | 8.4% |
| o | 663356 | 8.0% |
| s | 604292 | 7.3% |
| e | 451896 | 5.5% |
| r | 443948 | 5.4% |
| l | 435199 | 5.3% |
| t | 293046 | 3.5% |
| C | 205185 | 2.5% |
| Other values (36) | 2468880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8255930 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1132803 | |
| i | 865262 | 10.5% |
| n | 692063 | 8.4% |
| o | 663356 | 8.0% |
| s | 604292 | 7.3% |
| e | 451896 | 5.5% |
| r | 443948 | 5.4% |
| l | 435199 | 5.3% |
| t | 293046 | 3.5% |
| C | 205185 | 2.5% |
| Other values (36) | 2468880 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8255930 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1132803 | |
| i | 865262 | 10.5% |
| n | 692063 | 8.4% |
| o | 663356 | 8.0% |
| s | 604292 | 7.3% |
| e | 451896 | 5.5% |
| r | 443948 | 5.4% |
| l | 435199 | 5.3% |
| t | 293046 | 3.5% |
| C | 205185 | 2.5% |
| Other values (36) | 2468880 |
COUNTY
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.9 MiB |
| City Center | |
|---|---|
| Dolphin | |
| Orange | |
| Santa Cruz | |
| Scott | |
| Other values (4) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 9.0649369 |
| Min length | 5 |
Characters and Unicode
| Total characters | 8760355 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Scott |
|---|---|
| 2nd row | City Center |
| 3rd row | City Center |
| 4th row | City Center |
| 5th row | City Center |
Common Values
| Value | Count | Frequency (%) |
| City Center | 507296 | |
| Dolphin | 148154 | 15.3% |
| Orange | 86564 | 9.0% |
| Santa Cruz | 63187 | 6.5% |
| Scott | 50866 | 5.3% |
| Silver | 40951 | 4.2% |
| Spencer | 31985 | 3.3% |
| Stephens | 21727 | 2.2% |
| Sumter | 15670 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| city | 507296 | |
| center | 507296 | |
| dolphin | 148154 | 9.6% |
| orange | 86564 | 5.6% |
| santa | 63187 | 4.1% |
| cruz | 63187 | 4.1% |
| scott | 50866 | 3.3% |
| silver | 40951 | 2.7% |
| spencer | 31985 | 2.1% |
| stephens | 21727 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1265201 | |
| t | 1216908 | |
| C | 1077779 | |
| n | 858913 | |
| r | 745653 | |
| i | 696401 | |
| 570483 | ||
| y | 507296 | 5.8% |
| S | 224386 | 2.6% |
| a | 212938 | 2.4% |
| Other values (13) | 1384397 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8760355 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1265201 | |
| t | 1216908 | |
| C | 1077779 | |
| n | 858913 | |
| r | 745653 | |
| i | 696401 | |
| 570483 | ||
| y | 507296 | 5.8% |
| S | 224386 | 2.6% |
| a | 212938 | 2.4% |
| Other values (13) | 1384397 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8760355 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1265201 | |
| t | 1216908 | |
| C | 1077779 | |
| n | 858913 | |
| r | 745653 | |
| i | 696401 | |
| 570483 | ||
| y | 507296 | 5.8% |
| S | 224386 | 2.6% |
| a | 212938 | 2.4% |
| Other values (13) | 1384397 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8760355 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1265201 | |
| t | 1216908 | |
| C | 1077779 | |
| n | 858913 | |
| r | 745653 | |
| i | 696401 | |
| 570483 | ||
| y | 507296 | 5.8% |
| S | 224386 | 2.6% |
| a | 212938 | 2.4% |
| Other values (13) | 1384397 |
STREET
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 56.2 MiB |
| Str1 | |
|---|---|
| Str4 | |
| Str2 | |
| Str5 | |
| Str3 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3865600 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Str3 |
|---|---|
| 2nd row | Str3 |
| 3rd row | Str2 |
| 4th row | Str5 |
| 5th row | Str4 |
Common Values
| Value | Count | Frequency (%) |
| Str1 | 201506 | |
| Str4 | 198939 | |
| Str2 | 194809 | |
| Str5 | 194441 | |
| Str3 | 176705 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| str1 | 201506 | |
| str4 | 198939 | |
| str2 | 194809 | |
| str5 | 194441 | |
| str3 | 176705 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 966400 | |
| t | 966400 | |
| r | 966400 | |
| 1 | 201506 | 5.2% |
| 4 | 198939 | 5.1% |
| 2 | 194809 | 5.0% |
| 5 | 194441 | 5.0% |
| 3 | 176705 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3865600 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 966400 | |
| t | 966400 | |
| r | 966400 | |
| 1 | 201506 | 5.2% |
| 4 | 198939 | 5.1% |
| 2 | 194809 | 5.0% |
| 5 | 194441 | 5.0% |
| 3 | 176705 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3865600 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 966400 | |
| t | 966400 | |
| r | 966400 | |
| 1 | 201506 | 5.2% |
| 4 | 198939 | 5.1% |
| 2 | 194809 | 5.0% |
| 5 | 194441 | 5.0% |
| 3 | 176705 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3865600 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 966400 | |
| t | 966400 | |
| r | 966400 | |
| 1 | 201506 | 5.2% |
| 4 | 198939 | 5.1% |
| 2 | 194809 | 5.0% |
| 5 | 194441 | 5.0% |
| 3 | 176705 | 4.6% |
PRODUCT_CODE
Text
| Distinct | 94 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 6764800 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PRD0147 |
|---|---|
| 2nd row | PRD0016 |
| 3rd row | PRD0118 |
| 4th row | PRD0079 |
| 5th row | PRD0080 |
| Value | Count | Frequency (%) |
| prd0106 | 107597 | 11.1% |
| prd0105 | 51482 | 5.3% |
| prd0147 | 43794 | 4.5% |
| prd0069 | 33926 | 3.5% |
| prd0058 | 31868 | 3.3% |
| prd0094 | 30319 | 3.1% |
| prd0015 | 29173 | 3.0% |
| prd0112 | 27877 | 2.9% |
| prd0107 | 26910 | 2.8% |
| prd0096 | 26421 | 2.7% |
| Other values (84) | 557033 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1736312 | |
| P | 966400 | |
| R | 966400 | |
| D | 966400 | |
| 1 | 641564 | 9.5% |
| 6 | 290649 | 4.3% |
| 5 | 230585 | 3.4% |
| 9 | 202083 | 3.0% |
| 2 | 187942 | 2.8% |
| 8 | 156096 | 2.3% |
| Other values (3) | 420369 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6764800 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1736312 | |
| P | 966400 | |
| R | 966400 | |
| D | 966400 | |
| 1 | 641564 | 9.5% |
| 6 | 290649 | 4.3% |
| 5 | 230585 | 3.4% |
| 9 | 202083 | 3.0% |
| 2 | 187942 | 2.8% |
| 8 | 156096 | 2.3% |
| Other values (3) | 420369 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6764800 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1736312 | |
| P | 966400 | |
| R | 966400 | |
| D | 966400 | |
| 1 | 641564 | 9.5% |
| 6 | 290649 | 4.3% |
| 5 | 230585 | 3.4% |
| 9 | 202083 | 3.0% |
| 2 | 187942 | 2.8% |
| 8 | 156096 | 2.3% |
| Other values (3) | 420369 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6764800 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1736312 | |
| P | 966400 | |
| R | 966400 | |
| D | 966400 | |
| 1 | 641564 | 9.5% |
| 6 | 290649 | 4.3% |
| 5 | 230585 | 3.4% |
| 9 | 202083 | 3.0% |
| 2 | 187942 | 2.8% |
| 8 | 156096 | 2.3% |
| Other values (3) | 420369 | 6.2% |
CATEGORY
Categorical
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.7 MiB |
| Soap | |
|---|---|
| Perfume and Deodrants | |
| Hair Care | |
| Lotion | |
| Kids Care | |
| Other values (2) |
Length
| Max length | 21 |
|---|---|
| Median length | 9 |
| Mean length | 9.9073489 |
| Min length | 4 |
Characters and Unicode
| Total characters | 9574462 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Kids Care |
|---|---|
| 2nd row | Hair Care |
| 3rd row | Soap |
| 4th row | Perfume and Deodrants |
| 5th row | Perfume and Deodrants |
Common Values
| Value | Count | Frequency (%) |
| Soap | 251031 | |
| Perfume and Deodrants | 224223 | |
| Hair Care | 203882 | |
| Lotion | 138579 | |
| Kids Care | 101069 | |
| Dental | 47542 | 4.9% |
| Wipes | 74 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| care | 304951 | |
| soap | 251031 | |
| perfume | 224223 | |
| and | 224223 | |
| deodrants | 224223 | |
| hair | 203882 | |
| lotion | 138579 | |
| kids | 101069 | 5.9% |
| dental | 47542 | 2.8% |
| wipes | 74 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1255852 | |
| e | 1025236 | |
| r | 957279 | 10.0% |
| 753397 | 7.9% | |
| o | 752412 | 7.9% |
| n | 634567 | 6.6% |
| d | 549515 | 5.7% |
| i | 443604 | 4.6% |
| t | 410344 | 4.3% |
| s | 325366 | 3.4% |
| Other values (13) | 2466890 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9574462 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1255852 | |
| e | 1025236 | |
| r | 957279 | 10.0% |
| 753397 | 7.9% | |
| o | 752412 | 7.9% |
| n | 634567 | 6.6% |
| d | 549515 | 5.7% |
| i | 443604 | 4.6% |
| t | 410344 | 4.3% |
| s | 325366 | 3.4% |
| Other values (13) | 2466890 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9574462 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1255852 | |
| e | 1025236 | |
| r | 957279 | 10.0% |
| 753397 | 7.9% | |
| o | 752412 | 7.9% |
| n | 634567 | 6.6% |
| d | 549515 | 5.7% |
| i | 443604 | 4.6% |
| t | 410344 | 4.3% |
| s | 325366 | 3.4% |
| Other values (13) | 2466890 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9574462 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1255852 | |
| e | 1025236 | |
| r | 957279 | 10.0% |
| 753397 | 7.9% | |
| o | 752412 | 7.9% |
| n | 634567 | 6.6% |
| d | 549515 | 5.7% |
| i | 443604 | 4.6% |
| t | 410344 | 4.3% |
| s | 325366 | 3.4% |
| Other values (13) | 2466890 |
SUBCATEGORY
Categorical
HIGH CORRELATION 
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 62.6 MiB |
| Shampoo | |
|---|---|
| Head Lotion | |
| Soap Gels | |
| Toilet Soap | |
| Female Perfume | |
| Other values (20) |
Length
| Max length | 15 |
|---|---|
| Median length | 13 |
| Mean length | 10.878238 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10512729 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Baby Cream |
|---|---|
| 2nd row | Hair Oil |
| 3rd row | Medicated Soap |
| 4th row | Male Perfume |
| 5th row | Unisex Perfume |
Common Values
| Value | Count | Frequency (%) |
| Shampoo | 123413 | |
| Head Lotion | 82722 | 8.6% |
| Soap Gels | 68794 | 7.1% |
| Toilet Soap | 68763 | 7.1% |
| Female Perfume | 60317 | 6.2% |
| Female Deodrant | 58640 | 6.1% |
| Body Lotion | 55857 | 5.8% |
| Liquid Soap | 50112 | 5.2% |
| Hair Oil | 48057 | 5.0% |
| Medicated Soap | 46940 | 4.9% |
| Other values (15) | 302785 |
Length
| Value | Count | Frequency (%) |
| soap | 251031 | |
| shampoo | 151515 | 8.9% |
| lotion | 138579 | 8.2% |
| perfume | 125338 | 7.4% |
| female | 118957 | 7.0% |
| deodrant | 98885 | 5.8% |
| head | 82722 | 4.9% |
| baby | 71896 | 4.2% |
| gels | 68794 | 4.0% |
| toilet | 68763 | 4.0% |
| Other values (17) | 523706 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1214507 | 11.6% |
| e | 1169659 | 11.1% |
| a | 1037431 | 9.9% |
| 733786 | 7.0% | |
| i | 612885 | 5.8% |
| t | 490496 | 4.7% |
| m | 439604 | 4.2% |
| p | 431719 | 4.1% |
| d | 413868 | 3.9% |
| S | 402546 | 3.8% |
| Other values (26) | 3566228 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10512729 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1214507 | 11.6% |
| e | 1169659 | 11.1% |
| a | 1037431 | 9.9% |
| 733786 | 7.0% | |
| i | 612885 | 5.8% |
| t | 490496 | 4.7% |
| m | 439604 | 4.2% |
| p | 431719 | 4.1% |
| d | 413868 | 3.9% |
| S | 402546 | 3.8% |
| Other values (26) | 3566228 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10512729 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1214507 | 11.6% |
| e | 1169659 | 11.1% |
| a | 1037431 | 9.9% |
| 733786 | 7.0% | |
| i | 612885 | 5.8% |
| t | 490496 | 4.7% |
| m | 439604 | 4.2% |
| p | 431719 | 4.1% |
| d | 413868 | 3.9% |
| S | 402546 | 3.8% |
| Other values (26) | 3566228 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10512729 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1214507 | 11.6% |
| e | 1169659 | 11.1% |
| a | 1037431 | 9.9% |
| 733786 | 7.0% | |
| i | 612885 | 5.8% |
| t | 490496 | 4.7% |
| m | 439604 | 4.2% |
| p | 431719 | 4.1% |
| d | 413868 | 3.9% |
| S | 402546 | 3.8% |
| Other values (26) | 3566228 |
BRAND
Text
| Distinct | 89 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.0 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 17 |
| Mean length | 8.1101076 |
| Min length | 3 |
Characters and Unicode
| Total characters | 7837608 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mint |
|---|---|
| 2nd row | Magenta |
| 3rd row | Burgundy |
| 4th row | Ivory |
| 5th row | Umber |
| Value | Count | Frequency (%) |
| shoulders | 107597 | 7.6% |
| hair | 107597 | 7.6% |
| 107597 | 7.6% | |
| green | 55054 | 3.9% |
| garnet | 51482 | 3.7% |
| toothy | 47412 | 3.4% |
| mint | 43794 | 3.1% |
| blue | 42376 | 3.0% |
| fuchsia | 34059 | 2.4% |
| arctic | 33926 | 2.4% |
| Other values (91) | 778481 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 731404 | 9.3% |
| r | 642306 | 8.2% |
| a | 598243 | 7.6% |
| i | 477346 | 6.1% |
| o | 470321 | 6.0% |
| 442975 | 5.7% | |
| l | 394044 | 5.0% |
| n | 391404 | 5.0% |
| u | 378063 | 4.8% |
| s | 327239 | 4.2% |
| Other values (39) | 2984263 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7837608 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 731404 | 9.3% |
| r | 642306 | 8.2% |
| a | 598243 | 7.6% |
| i | 477346 | 6.1% |
| o | 470321 | 6.0% |
| 442975 | 5.7% | |
| l | 394044 | 5.0% |
| n | 391404 | 5.0% |
| u | 378063 | 4.8% |
| s | 327239 | 4.2% |
| Other values (39) | 2984263 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7837608 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 731404 | 9.3% |
| r | 642306 | 8.2% |
| a | 598243 | 7.6% |
| i | 477346 | 6.1% |
| o | 470321 | 6.0% |
| 442975 | 5.7% | |
| l | 394044 | 5.0% |
| n | 391404 | 5.0% |
| u | 378063 | 4.8% |
| s | 327239 | 4.2% |
| Other values (39) | 2984263 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7837608 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 731404 | 9.3% |
| r | 642306 | 8.2% |
| a | 598243 | 7.6% |
| i | 477346 | 6.1% |
| o | 470321 | 6.0% |
| 442975 | 5.7% | |
| l | 394044 | 5.0% |
| n | 391404 | 5.0% |
| u | 378063 | 4.8% |
| s | 327239 | 4.2% |
| Other values (39) | 2984263 |
| CATEGORY | COUNTY | DISTRIBUTOR_CODE | MNTH_CODE | OC_CODE | SALES_PTR_VALUE | SALES_UNITS | SALES_VALUE | SALES_VOLUME | STATE | STREET | SUBCATEGORY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CATEGORY | 1.000 | 0.091 | 0.024 | 0.026 | 0.039 | 0.008 | 0.006 | 0.008 | 0.005 | 0.040 | 0.008 | 1.000 |
| COUNTY | 0.091 | 1.000 | 0.089 | 0.038 | 0.030 | 0.008 | 0.005 | 0.008 | 0.008 | 0.080 | 0.061 | 0.194 |
| DISTRIBUTOR_CODE | 0.024 | 0.089 | 1.000 | 0.011 | 0.014 | 0.008 | 0.005 | 0.008 | 0.007 | 0.089 | 0.033 | 0.071 |
| MNTH_CODE | 0.026 | 0.038 | 0.011 | 1.000 | -0.350 | -0.010 | -0.020 | -0.008 | 0.007 | 0.018 | 0.009 | 0.079 |
| OC_CODE | 0.039 | 0.030 | 0.014 | -0.350 | 1.000 | 0.006 | 0.046 | -0.007 | -0.017 | 0.013 | 0.007 | 0.064 |
| SALES_PTR_VALUE | 0.008 | 0.008 | 0.008 | -0.010 | 0.006 | 1.000 | -0.126 | 0.990 | 0.894 | 0.027 | 0.005 | 0.012 |
| SALES_UNITS | 0.006 | 0.005 | 0.005 | -0.020 | 0.046 | -0.126 | 1.000 | -0.132 | 0.067 | 0.005 | 0.005 | 0.012 |
| SALES_VALUE | 0.008 | 0.008 | 0.008 | -0.008 | -0.007 | 0.990 | -0.132 | 1.000 | 0.885 | 0.025 | 0.005 | 0.011 |
| SALES_VOLUME | 0.005 | 0.008 | 0.007 | 0.007 | -0.017 | 0.894 | 0.067 | 0.885 | 1.000 | 0.018 | 0.004 | 0.010 |
| STATE | 0.040 | 0.080 | 0.089 | 0.018 | 0.013 | 0.027 | 0.005 | 0.025 | 0.018 | 1.000 | 0.098 | 0.048 |
| STREET | 0.008 | 0.061 | 0.033 | 0.009 | 0.007 | 0.005 | 0.005 | 0.005 | 0.004 | 0.098 | 1.000 | 0.019 |
| SUBCATEGORY | 1.000 | 0.194 | 0.071 | 0.079 | 0.064 | 0.012 | 0.012 | 0.011 | 0.010 | 0.048 | 0.019 | 1.000 |
| MNTH_CODE | TRANS_DATE | START_DATE | SALES_VALUE | SALES_UNITS | SALES_VOLUME | SALES_PTR_VALUE | OC_CODE | DISTRIBUTOR_CODE | OUTLET_CODE | CITY | STATE | COUNTY | STREET | PRODUCT_CODE | CATEGORY | SUBCATEGORY | BRAND | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 202311 | 2023-11-10 | 2023-10-30 | 142.86 | 18 | 0.000432 | 144.642857 | 202211 | DB0209 | OL12036 | Wauwatosa | Wisconsin | Scott | Str3 | PRD0147 | Kids Care | Baby Cream | Mint |
| 1 | 202311 | 2023-11-09 | 2023-10-30 | 518.18 | 3 | 0.001155 | 518.181818 | 202211 | DB0706 | OL49989 | Huntington | West Virginia | City Center | Str3 | PRD0016 | Hair Care | Hair Oil | Magenta |
| 2 | 202311 | 2023-11-09 | 2023-10-30 | 186.36 | 1 | 0.000325 | 186.363636 | 202211 | DB0209 | OL112160 | Saint Augustine | Florida | City Center | Str2 | PRD0118 | Soap | Medicated Soap | Burgundy |
| 3 | 202311 | 2023-11-07 | 2023-10-30 | 1609.09 | 3 | 0.003000 | 1609.090909 | 202211 | DB0209 | OL175188 | Redwood City | California | City Center | Str5 | PRD0079 | Perfume and Deodrants | Male Perfume | Ivory |
| 4 | 202311 | 2023-11-12 | 2023-10-30 | 309.09 | 1 | 0.000500 | 309.090909 | 202211 | DB0209 | OL80360 | Kokomo | Indiana | City Center | Str4 | PRD0080 | Perfume and Deodrants | Unisex Perfume | Umber |
| 5 | 202311 | 2023-11-12 | 2023-10-30 | 142.86 | 16 | 0.000272 | 142.857143 | 202211 | DB0655 | OL113196 | Jersey City | New Jersey | Stephens | Str4 | PRD0028 | Soap | Toilet Soap | Indigo |
| 6 | 202311 | 2023-11-09 | 2023-10-30 | 133.64 | 3 | 0.000300 | 133.636364 | 202211 | DB0110 | OL11802 | Bloomsburg | Pennsylvania | City Center | Str1 | PRD0095 | Soap | Medicated Soap | Sea green |
| 7 | 202311 | 2023-11-02 | 2023-10-30 | 254.55 | 2 | 0.000500 | 254.545455 | 202211 | DB0652 | OL33490 | Narragansett | Rhode Island | City Center | Str5 | PRD0107 | Lotion | Body Lotion | Coral |
| 8 | 202311 | 2023-11-09 | 2023-10-30 | 214.29 | 12 | 0.000780 | 214.285714 | 202211 | DB0706 | OL160013 | Winslow | Arizona | City Center | Str2 | PRD0009 | Dental | ToothPaste | Toothy Coal |
| 9 | 202311 | 2023-10-31 | 2023-10-30 | 133.93 | 16 | 0.000384 | 128.571429 | 202211 | DB0652 | OL191004 | Lufkin | Texas | Orange | Str2 | PRD0106 | Hair Care | Shampoo | Hair & Shoulders |
| MNTH_CODE | TRANS_DATE | START_DATE | SALES_VALUE | SALES_UNITS | SALES_VOLUME | SALES_PTR_VALUE | OC_CODE | DISTRIBUTOR_CODE | OUTLET_CODE | CITY | STATE | COUNTY | STREET | PRODUCT_CODE | CATEGORY | SUBCATEGORY | BRAND | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 966390 | 202311 | 2023-11-02 | 2023-10-30 | 172.73 | 1 | 0.000385 | 172.727273 | 202211 | DB0652 | OL175440 | Salem | Missouri | City Center | Str3 | PRD0016 | Hair Care | Hair Oil | Magenta |
| 966391 | 202311 | 2023-11-07 | 2023-10-30 | 1589.09 | 16 | 0.001712 | 1672.727273 | 202211 | DB0209 | OL190869 | Stamford | Connecticut | City Center | Str1 | PRD0153 | Lotion | Body Lotion | Mustard |
| 966392 | 202311 | 2023-11-15 | 2023-10-30 | 6480.00 | 108 | 0.018144 | 6750.000000 | 202211 | DB0209 | OL222200 | Huntsville | Alabama | City Center | Str2 | PRD0086 | Lotion | Head Lotion | Peach |
| 966393 | 202311 | 2023-11-23 | 2023-10-30 | 134.29 | 16 | 0.000416 | 142.857143 | 202211 | DB0209 | OL64632 | Weston | West Virginia | Scott | Str1 | PRD0069 | Perfume and Deodrants | Female Deodrant | Arctic blue |
| 966394 | 202311 | 2023-11-24 | 2023-10-30 | 53.57 | 12 | 0.000132 | 53.571429 | 202211 | DB0655 | OL81536 | Woodward | Oklahoma | Scott | Str2 | PRD0058 | Soap | Liquid Soap | Rust |
| 966395 | 202311 | 2023-11-16 | 2023-10-30 | 268.57 | 32 | 0.000832 | 285.714286 | 202211 | DB0110 | OL81665 | Mattoon | Illinois | City Center | Str4 | PRD0069 | Perfume and Deodrants | Female Deodrant | Arctic blue |
| 966396 | 202311 | 2023-11-07 | 2023-10-30 | 134.29 | 16 | 0.000448 | 142.857143 | 202211 | DB0706 | OL65911 | Vincennes | Indiana | City Center | Str2 | PRD0094 | Perfume and Deodrants | Unisex Perfume | Mocha |
| 966397 | 202311 | 2023-11-11 | 2023-10-30 | 134.42 | 16 | 0.000448 | 142.857143 | 202211 | DB0110 | OL191975 | Yorba Linda | California | City Center | Str3 | PRD0094 | Perfume and Deodrants | Unisex Perfume | Mocha |
| 966398 | 202311 | 2023-11-21 | 2023-10-30 | 202.82 | 2 | 0.000214 | 209.090909 | 202211 | DB0706 | OL49926 | Watertown | Massachusetts | City Center | Str1 | PRD0159 | Hair Care | Hair Oil | Lily |
| 966399 | 202311 | 2023-11-08 | 2023-10-30 | 125.00 | 16 | 0.000384 | 128.571429 | 202211 | DB0652 | OL65645 | Richmond | Kentucky | Dolphin | Str5 | PRD0147 | Kids Care | Baby Cream | Mint |
Most frequently occurring
| MNTH_CODE | TRANS_DATE | START_DATE | SALES_VALUE | SALES_UNITS | SALES_VOLUME | SALES_PTR_VALUE | OC_CODE | DISTRIBUTOR_CODE | OUTLET_CODE | CITY | STATE | COUNTY | STREET | PRODUCT_CODE | CATEGORY | SUBCATEGORY | BRAND | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 202309 | 2023-10-01 | 2023-08-28 | 8.93 | 1 | 0.000017 | 8.928571 | 202209 | DB0209 | OL65494 | Shelbyville | Tennessee | Santa Cruz | Str5 | PRD0028 | Soap | Toilet Soap | Indigo | 2 |
| 1 | 202309 | 2023-10-01 | 2023-08-28 | 8.93 | 2 | 0.000022 | 8.928571 | 202209 | DB0110 | OL112602 | Junction City | Kansas | Silver | Str2 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |
| 2 | 202309 | 2023-10-01 | 2023-08-28 | 8.93 | 2 | 0.000022 | 8.928571 | 202209 | DB0110 | OL238594 | Dallas | Texas | Spencer | Str3 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |
| 3 | 202309 | 2023-10-01 | 2023-08-28 | 8.93 | 2 | 0.000022 | 8.928571 | 202209 | DB0110 | OL81664 | Hattiesburg | Mississippi | Orange | Str3 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |
| 4 | 202309 | 2023-10-01 | 2023-08-28 | 13.39 | 3 | 0.000033 | 13.392857 | 202209 | DB0110 | OL144800 | Excelsior Springs | Missouri | Orange | Str4 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |
| 5 | 202309 | 2023-10-01 | 2023-08-28 | 13.39 | 3 | 0.000033 | 13.392857 | 202209 | DB0110 | OL222617 | Indiana | Pennsylvania | Dolphin | Str4 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |
| 6 | 202309 | 2023-10-01 | 2023-08-28 | 13.39 | 3 | 0.000033 | 13.392857 | 202209 | DB0110 | OL33285 | Fall River | Massachusetts | City Center | Str3 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |
| 7 | 202309 | 2023-10-01 | 2023-08-28 | 13.39 | 3 | 0.000033 | 13.392857 | 202209 | DB0110 | OL49488 | Alliance | Ohio | Dolphin | Str3 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |
| 8 | 202309 | 2023-10-01 | 2023-08-28 | 17.86 | 2 | 0.000046 | 17.857143 | 202209 | DB0652 | OL96577 | West Covina | California | City Center | Str1 | PRD0027 | Dental | ToothPaste | Toothy Fresh | 2 |
| 9 | 202309 | 2023-10-01 | 2023-08-28 | 17.86 | 4 | 0.000044 | 17.857143 | 202209 | DB0110 | OL80741 | Glenview | Illinois | Scott | Str2 | PRD0105 | Perfume and Deodrants | Female Perfume | Garnet | 2 |